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AN INTRODUCTION TO JOININGS IN ERGODIC 

THEORY 

THIERRY DE LA RUE 



Abstract. Since their introduction by Furstenberg P], joinings have 
proved a very powerful tool in ergodic theory. We present here some 
aspects of the use of joinings in the study of measurable dynamical 
systems, emphasizing on 

• the links between the existence of a non trivial common factor and 
the existence of a joining which is not the product measure, 

• how joinings can be employed to provide elegant proofs of classical 
results, 

• how joinings are involved in important questions of ergodic theory, 
such as pointwise convergence or Rohlin's multiple mixing prob- 
lem. 
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1. What are we talking about? 

1.1. Dynamical systems and stationary processes. We call here a dy- 
namical system any quadruple of the form {X, , n,T), where {X,£/,fj,) is 
a Lebesgue space (or equivalently a standard Borel space equipped with a 
probability measure fi) and T an automorphism of {X, £/, T is a one-to- 
one measurable transformation of X satisfying, for any measurable subset 
A of X, 

^l{T-^A) = f,{TA) = fiiA). 

Throughout this text, we will often use simply the symbol T to designate the 
dynamical system {X, £/, fj,,T). We will also often need a second dynamical 
system S, which will stand for the quadruple (Y, u, S) 

Such dynamical systems, which are the objects of interest in measurable 
ergodic theory, can be considered from a rather probabilistic viewpoint by 
studying stationary processes. Indeed, any measurable map ^ X — > S 
gives rise to a stationary process ^ = (Cfc)fcGZ defined by := ° (The 
set E in which ^ takes its values could be any standard Borel space; most 
of the time, it will be a finite or countable alphabet.) Conversely, to any 

valued stationary process ^ = {£,k)ke'E: we can associate in a canonical 
way a dynamical system: take X^ := E'^, the sample space of the whole 
process, equipped with its Borel cr-algebra ^ and the law //^ of the process. 
Since ^ is stationary, /xg is invariant by the shift T^: {xk)kez ' — > {x'f^)ke'E 
where x'^ := Xk+i- 

Definition 1.1. In the dynamical system {X, £/ , iJ,,T), the stationary pro- 
cess £, = {^0 ° 2^^)fcGZ called a generating process if the a-algebra it gen- 
erates is, modulo fi, the whole a-algebra £/. 

Observe that in the dynamical system (X^, =2^, /x^, Tg), the process (ob- 
tained by looking at the coordinates) is always a generating process. 

The two dynamical systems T and S are said to be isomorphic if we 
can find invariant subsets"^ Xq C X, Yq C Y, with n{Xo) = i^{Yo) = 1, 
and a measurable one-to-one map ■0 '■ ~^ ^ with, for all B € 

n{il)~^B) = ^{B) and such that 5 o -i/; = -0 o T. If ^ is a generating process 
for the dynamical system T, then T is isomorphic to the dynamical system 

defined above. Recall the following well-known result of ergodic theory, 
which shows that studying dynamical systems is essentially the same thing 
as studying stationary processes taking values in a countable alphabet. (A 
nice proof can be found in jlOj.^ 

Theorem 1.2. Every dynamical system admits a generating process taking 
its values in a countable alphabet. 



In the sequel, we will not explicitely mention this restriction to subsets of full measure; 
it should be understood that every map is really defined on a subset of probability 1. 
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1.2. Products and factors: arithmetic of dynamical systems. In his 

famous article initiating the theory of joinings [H], Furstenberg observes that 
a kind of arithmetic can be done with dynamical systems: Indeed, there are 
two natural operations in ergodic theory which present some analogy with 
the integers. 

First, if we are given two dynamical systems {X, , /j,, T) and {Y, v, S), 
we can construct their direct product {X x Y,£/ ® SS^ v^T x S) where 
T X S : {x,y) i — > {Tx,Sy). Note that the Cartesian product X x Y is 
here equipped with the direct product of the probability measures fi and , 
which is always T x S'-invariant. Acting on equivalence classes of isomorphic 
dynamical systems, the direct product operation is commutative, associa- 
tive, and possesses a neutral element which is the trivial system reduced to 
one singleton. Translated in the context of stationary processes, if ^ and C 
are respectively S-valued and F-valued stationary processes (not necessar- 
ily defined on the same probability space) , their direct product ^ (8) C is the 
E X F-valued stationary process whose law is the direct product of the laws 
of ^ and C- That is to say, we are making the two processes ^ and C live 
together in an independant (hence stationary) way. 

Second, we say that (Y, u, S) is a factor of {X, £/, /i, T) if we can find 
a measurable map vr : X ^ Y satisfying 

• 7r{fi) = u ; 

• TT oT = S O IT. 

Such a map is called a homomorphism of dynamical systems. Heuristically, 
the existence of such a homomorphism means that we can see the system 5 
inside the system T by looking at tt{x). To this factor is associated the factor 
a -algebra 7r~^(^). This fi-algebra has the property that if A is measurable 
with respect to 7r~^(=^), then so are T{A) and T~^{A). Conversely, each 
sub-(T-algebra ^ of which is invariant by T and is a factor u-algebra. 
Indeed, such a a-algebra is always generated by some stationary process ^ 
living in the system T (this is an adaptation of Theorem 11.21 to the case of 
a factor cr-algebra), and the dynamical system associated to ^ is a factor 
of T: Take 7r(x) :=(... ,C-~i{x),^o{x),^i{x), . . .). 

Clearly, both systems S and T are factors of the direct product T x S: 
These systems are seen by looking at the two coordinates in the Cartesian 
product. The analogy with the integers introduced by Furstenberg now gives 
two ways to define "T and S are relatively prime" . 

Property 1. T and S have no common factor except the trivial system 
reduced to one point. 

Property 2. Each time T and S appear as factors in some dynamical sys- 
tem, then their direct product T x S also appears as a factor^. 

^To be exact, Property|5|should be stated in the following, stronger form: Each time T 
and 5* appear as factors in some dynamical system through the respective homomorphisms 
ttt and tvs, T x S also appears as a factor through a homomorphism tttxS such that 



4 



THIERRY DE LA RUE 



The relationships between these two properties are exposed in Section IHI 
Property 121 has been cahed by Furstenberg the disjointness property of T 
and S. The study of disjointness turns out to be quite enhghtening, as it 
refers to all the possible ways that two systems can he seen living together 
inside a third one. As explained in the next subsection, this is precisely the 
theory of joinings. 

The large scale of applications of disjointness and joinings is already vis- 
ible in Furstenberg's article where the concepts of disjointness and joinings 
where introduced. Indeed, Furstenberg's initial motivations ranged from 
the classification of dynamical systems (how disjointness can be used to 
characterized some classes of dynamical systems) to a question in Diophan- 
tine approximation (which multiplicative semigroups of integers S have the 
property that any real number is the limit of a sequence of rational numbers 
with denominators belonging to SI), passing through a filtering problem in 
probability theory (if (^„) and (Cn) are two real-valued stationary processes, 
can we recover (^„) from the knowledge of {^n + Cn)?)- Since Furstenberg's 
article, disjointness and joinings have been widely studied, and many other 
related notions have been introduced. In the present work, we shall mainly 
concentrate on some links between joinings and other ergodic properties of 
dynamical systems. For a more complete treatment of ergodic theory via 
joinings, we refer the readers to Eli Glasner's book [H]. 

1.3. Joinings. 

Definition 1.3. Let {X,£/, ^,T) and {Y,I^,u,S) he two dynamical sys- 
tems. A joining of T and S is a probability measure A on the Cartesian 
product X xY, whose marginals on X and Y are ^ and v respectively, and 
which is invariant by the product transformation T x S. 

The set of joinings of T and S, denoted by J(T, S), is never empty since 
it always contains the product measure ^ ® v. Each A G J{T, S) provides 
a new dynamical system {X xY,£^ ® i^,X,T x S), which will be denoted 
by (T X S)\ to avoid confusion with the direct product T x S. Both T 
and S are factors of (T x S)x. Conversely, if T and S appear as factors 
in a third system (Z, ^ , p, R) through respectively the maps ttx ■ Z ^ X 
and TTy : Z — > y, we can consider the map vr : Z ^ X x Y defined by 
7r(z) := (7rx(z), vTy (z)). Then the probability measure A := tt{p) on X x Y 
satisfies the conditions to be a joining of T and S, and the system (T x S)\ 
is a factor of R. 

Topology on the set of joinings. The set of joinings of T and S is 
equipped with the topology defined by 

(1) Xn > A ^ yA£ j^, yB e-^, Xn{A X B) > X{A x B). 

n— >oo n— »oo 



TTX o TTTxS = TTT and Try o TTTxS ~ TTs, where ttx and Try are the projections on the 
coordinates in the Cartesian product X x Y . 
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This topology is metrizable: An example of distance defining it is given by 
d{X,X') := ^\HAmXBn)-\'{A^xBn)\, 

m,n>0 

where {Am)m>o and (i?n)n>o are countable algebras generating respectively 
the (T-algebras and ^. With this topology, the set of joinings J(T, S) is 
turned into a compact metrizable topological space. It is interesting to ob- 
serve that, when X and Y are compact metric spaces, this topology coincides 
with the weak topology on the set of joinings. 

All these definitions can be extended in an obvious way to the notion of 
joining of a finite or countable family of dynamical systems {Ti)i^j. If all 
the Ti's are copies of a single dynamical system T, we speak of self-joinings 
of T. The set of self-joinings of order k oi T (joinings of k copies of T) is 
denoted by Jk{T), and we simply write J(T) for J2{T). 

Ergodic joinings. We denote by Je{T,S) the set of ergodic joinings of T 
and 5. Since every factor of an ergodic system is still ergodic, a necessary 
condition for Je{T, S) not to be empty is that both T and S be ergodic. The 
following proposition shows that the converse is true. 

Proposition 1.4. Given two ergodic dynamical systems T and S, there 
exists at least one ergodic joining ofT and S. 

Proof. We start from the only joining whose existence is known, A := /x (X" z/. 
This joining may not be ergodic, but in this case we consider its decompo- 
sition in ergodic components: There exists a probability measure P on the 
set of all T x 5-invariant ergodic probability measures such that 

A = j X^dPiio). 

Denoting by A^ (respectively Aj) the marginals of the ergodic component 
A^ on X (respectively Y), we get 

II = J XidPiu;), 

and 

u = I XldPiu;). 

Since fi and u are ergodic, this implies that for P-almost every a;, A^ = 
and A^ = u. In other words, for P-almost every m, X^j is an ergodic joining 
of T and S. □ 

Note that this proof implies a stronger result: When T and S are ergodic, 
every joining of T and S* is a convex combination of ergodic joinings. Since 
an ergodic measure cannot be written as a non trivial convex combination 
of invariant measures, we get the following. 

Proposition 1.5. If T and S are ergodic, the set of ergodic joinings of T 
and S is the set of extremal points in the compact convex set J(T, S). 
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2. From disjointness to isomorphism 



In this section, we present the two extreme cases concerning the joinings 
of two dynamical systems T and S. The first one is when the set of joinings 
J(T, S) is reduced to the singleton u}. Heuristically, this means that 
T and S have nothing in common which could lead to a non-trivial joining 
(what they have in common when they are not disjoint is developped in 
Sections 13. .SI and 13. 4j) . To the opposite, if T and S are isomorphic, we can 
construct very special joinings of T and S", namely the joinings supported 
on graphs of isomorphisms. 

2.1. Product measure and disjointness. 

Definition 2.1. The dynamical systems {X, £/ , fj,,T) and {Y, i', S) are 
disjoint if the product measure ^ ® v is the only joining of T and S. We 
write in this case TJ-S. 

We leave as an exercise to the reader the verification of the equivalence 
of this definition with the strong form of Property [21 By the way, why was 
it necessary to state this strong form? (Hint: There exists a non-trivial 
dynamical system T which is isomorphic to the direct product T x T.) 

We now give the most elementary example of disjointness: The identity 
is disjoint from any ergodic dynamical system. 

Proposition 2.2. If T is the identity map on X and S is any ergodic dy- 
namical system, then TJS. 

Proof. Let A be any joining of T = Id and S. For any A G =e/, B G 3^, 
invariance of A by T x S gives 



Vn > 1, X{A X B) 



X{A X S-''B) 




n-1 



k=0 




n-1 



Since S is ergodic, ^ X]fc=o '^siS^v) converges to ^{B) i/-almost everywhere, 
hence A-almost everywhere. This implies 

A(^ X = ^x{A)v{B), 



hence \ = ^ ® v. 



□ 



Another useful way to state the preceding result is the following: 



Proposition 2.3. Let T and S he two dynamical systems, with S ergodic, 
and A G J(T, S). If X is invariant by Id xS, then \ = ^i. ® u. 
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2.1.1. Application: Furstenberg's multiple recurrence theorem for weakly mix- 
ing transformations. As a nice application of Proposition 12. .SI we give be- 
low an elegant proof of Furstenberg's multiple recurrence theorem in the 
(easy) case of weakly mixing transformations [3]. This proof was given by 
V. Ryzhikov in [22]. Recall that T is weakly mixing if T x T is ergodic, and 
that this implies 

• T X is ergodic for any ergodic system S (see also subsection 14. 1|) : 

• is weakly mixing for any k 0. 

Theorem 2.4. Let T be a weakly mixing dynamical system. For any integer 
k > 1, and any measurable subsets Aq, . . . , Ak, we have 
(2) 
1 " 

-y^fi{AonT~^AinT~^^A2n---nT~^^Ak) > fi{Ao)fi{Ai)--- fi{Ak). 

i=i 

Proof. Consider for any n > 1 the self-joining A,i of T of order k+1 defined 

by 

1 

A„(^oxAix...x^fe) := - V^(Aonr-^^inT-2j^2p...^p-fc,^^)^ 

n ^-^ 

i=i 

Note that the result we want to prove is equivalent to the following transla- 
tion in the language of joinings: 

\n > [I® p,® ■ ■ ■ ® [I. 

By compacity, it is enough to verify that if A is any cluster point of the 
sequence (A„,), then A is the product measure, which we shall prove by 
induction on k. Observe that such a cluster point is always Id xT X X 
• • • X T'^-invariant. Thus Proposition 12.31 gives the result when fc = 1 by 
ergodicity of T. Now take k > 1 and suppose the result is true for k — 
1. This induction hypothesis ensures that the marginal of A on the last k 
coordinates is the product measure. Hence A can be seen as a joining of Id 
with T X T'^ X ■ ■ ■ X T'^ . Since T x x • • • x T'^ is ergodic (because T is 
weakly mixing), Proposition 12.31 shows that the result is still true for k. □ 

2.1.2. Disjointness and classification of systems. As we have already men- 
tioned, one of the reasons for the introduction of disjointness by Furstenberg 
was the classification of classes of dynamical systems. And indeed there are 
many results in this direction: For example, Furstenberg proved that a dy- 
namical system has zero entropy if and only if it is disjoint from any Bernoulli 
dynamical system. It also follows from his work that T is weakly mixing 
if and only if it is disjoint from any rotation on the circle. Some further 
explanations on how disjointness can characterize such classes of dynamical 
systems will be given in Section 13.3.21 (see in particular Theorem 13. 6j) . 
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2.1.3. Disjointness and pointwise convergence of ergodic averages. In |17j 
was introduced the so-called weak disjointness property of dynamical sys- 
tems: 

Definition 2.5. {X, /i, T) and (Y, SS^ S) are said to be weakly disjoint 
if, given any function f in L?'{p) and any function g in L?'{v), there exist a 
set A in £/ and a set B in !M such that 

• fi{A) = u{B) = 1 

• for all X ^ A and for all y ^ B, the sequence 

(3) 

V n=0 / N>0 

converges. 

(In fact it is sufficient to check this statement for only dense families of 
/ and g in their respective spaces.) The main motivation for the intro- 
duction of this property was the study of non-conventional ergodic averages: 
Suppose that S and T act on the same probability space X, and consider 
for / and g in L?'{^) the averages 

^ 1 

-Y,f{T^x)g{S^x). 

k=0 

When do these averages converge ^-a.e.? If we assume that T and S are 
weakly disjoint, then almost-everywhere pointwise convergence holds. 

As we could guess from the name of the property, if T and S are disjoint, 
they are weakly disjoint. The link between joinings and convergence of 
ergodic sums like in (j^J comes from the following remark: We can always 
assume that T and S are continuous transformations of compact metric 
spaces X and Y respectively (indeed, any measurable system is isomorphic 
to such a transformation on a compact metric space: see e.g. Then the 
set of probability measures on X x Y equipped with the topology of weak 
convergence is metric compact. If moreover T and S are ergodic, we can 
easily find subsets Xq C X and Yq C Y with ij-{Xq) = i^(Yq) = 1, such that 
for all {x,y) G Xq x Yq, any cluster point of the sequence 

^ n—l 

6n{x,y) = -^S(T''x,S>'y) 
k=0 

is automatically a joining of T and S. When T and 5 are disjoint, there 
is therefore only one possible cluster point to the sequence 6n{x,y^ which 
is fJL® V. This ensures that for continuous functions / and g, convergence 
of © to the product of the integrals of / and g holds, hence T and S are 
weakly disjoint. 

It is not difficult to show that weak disjointness is really weaker than dis- 
jointness. Indeed, there are many examples of dynamical systems which 
are self- weakly disjoint (weakly disjoint from themselves): For example, 
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irrational rotations and Chacon's transformation are self-weakly disjoint 
(see |17j for details and other examples). However, a non trivial dynam- 
ical system is never disjoint from itself, as we will see in the next paragraph. 

2.2. Joinings supported on graphs. Suppose that 5 is a factor of T, 
with TT : X — > y a homomorphism. Then vr gives rise to the existence of 
a very special joining of T and S, which we denote by A^, and which is 
defined by 

A^(A X B) := fi{AmT-^B). 

Lemma 2.6. The joining Ayr has the following two properties: 

• A^(G) = 1, where G := {(x,7r(x)), x G X} is the graph of vr; 

• {X, 0} (g) ^ C (g) {y, 0} mod A^, where the notation C ^ 
mod A ' means that for each C in the a-algebra , there exists a D 
in the a-algebra Si such that X{CAD) = 0. 

Proof. Take a refining and generating sequence (^n) of finite measurable 
partitions in (Y, u). (That is to say, B^n+i is finer than and if y and 
y' are distinct points in Y , there exists some n for which y and y' do not 
belong to the same atom of this ensures in particular that the a-algebra 
generated by the partitions ^„ is S^.) Then G can be written as 

= n u ^~'(^)x^- 

But for any n, it is clear from the definition of A^ that 

A. I U TT-^S) I = 1, 

which proves that A^r is supported on the graph of vr. For the second prop- 
erty, it is easy to verify that A^r identifies any X x B, where B € with 
TT-^B xY. □ 

It is quite remarkable to see that the converse is true, i.e. we can char- 
acterize the fact that S" is a factor of T by the existence of some special 
joining. 

Proposition 2.7. Suppose that there exists a joining A € J(T, S) such that 

{X, 0} =^ C (g) {Y, 0} mod A. 

Then S is a factor of T and we can End a homomorphism vr : X ^ Y such 
that A is supported on the graph of vr. 

Proof. Take again a refining and generating sequence (.^n) of finite mea- 
surable partitions in (Y, v). For each n, can be identified via A with 
some in X, where {^n) is a refining, but not necessarily generating, 
sequence of partitions in {X, fi), Now, for ^u-almost every x G X, if we de- 
note by Bn the atom of to which x belongs and i?„ the atom of r'^n 
identified via A with Bn, then -^^ i^ ^ singleton. Denote by vr(x) the 
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only element of this set; we let the reader check that x i— 7r(x) satisfies the 
required properties. □ 

If we assume further that T and S are isomorphic (in other words, the 
homomorphism vr from X to y is invertible and vr"^ is also a homomorphism 
of dynamical systems; in this case we will rather use the symbol (p instead of 
vr), the joining A,^ now satisfies {X, ^} ® ^ = ® {Y, 0} mod A,^, where 
the notation ' ^ = ^ mod A' means that we have both d S! mod A 
and ^ C ^ mod A. The analog of Proposition 12.71 is also true, and we 
summarize all these results in the following theorem. 

Theorem 2.8. Let T and S he two dynamical systems. S is a factor of 
T if and only if there exists some joining A € J(T, S) such that {X, 0} (8) 
<Z £^ ® {y, 0} mod A; in this case A is supported on the graph of some 
homomorphism. T and S are isomorphic if and only if there exists some 
joining A € J{T, S) such that {X, 0} (g) ^ = i?/ (g) {Y, 0} mod A; in this case 
A is supported on the graph of some isomorphism. 

2.2.1. Application: isomorphism of discrete- spectrum transformations. There 
are very deep applications of the characterization of isomorphic systems by 
joinings. Krieger's theorem, stating that any dynamical system with en- 
tropy less than log2(fe) is isomorphic to the shift map on {1, . . . , fc} with an 
appropriate invariant measure, and Ornstein's theorem characterizing the 
dynamical systems which are isomorphic to some Bernoulli shift, can both 
be proved using this result (see [22, Chapter 7). Here we illustrate the 
power of Theorem 12 . 81 bv giving Lemahczyk's elegant proof of a well-known 
theorem due to Halmos and von Neumann. This proof is taken from j28j . 
Recall that a system T is said to have discrete spectrum if the subspace of 

spanned by the eigenvectors of / i— > / o T is dense in . 

Theorem 2.9. If T is ergodic and has discrete spectrum, and if S is spec- 
trally isomorphic to T, then T and S are isomorphic. 

Proof. First note that S is ergodic and also has discrete spectrum with the 
same eigenvalues as T, since it is spectrally isomorphic to T. Let A be an 
ergodic joining of T and S. For an eigenvalue a of T, let / be an eigenvector 
associated to a in the system T, and g associated to a in the system S. Then 
both functions (x, y) /(x) and (x, y) i— > g{y) are eigenvectors in (T x S)x. 
Since (T x S)x is an ergodic system, a is a simple eigenvalue in (T x S)x, 
hence {x,y) i-^ g{y) and (x,y) i— /(x) are proportional. This implies that 
(x,y) ^ fix) belongs to L2({X, 0} (g)^. A); but since eigenvectors are dense 
in L^{£/,fi), this in turn implies L^{£/ ® {^,0}, A) C L2({X,0} (g) ^, A). 
For the same reason, the inverse inclusion is also true, which proves that 
{X, %}® 3S = ^ (g {y, 0} mod A, and the two systems are isomorphic. □ 

2.2.2. Self-joinings, commutant and minimal self-joinings. When studying 
self-joinings of a single dynamical system T, we are obviously in a situation 
where the two systems that we want to join together are isomorphic. Now, 
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thinking of what an isomorphism between T and itself is, we see that this has 
to be an automorphism of the Lebesgue space (X,£/,fi), which commutes 
with T. We call commutant ofT, and denote by C{T), the set 

C{T) := {S e Aut{X,£/,n) : S oT = T o S}. 

For each S £ C{T), let us denote by A5 the self-joining of T defined by 

As{A X B) := fi{AnS-^B). 

This leads to an identification of C(T) with some subset of J2{T), namely 
the subset of all A € J2 (T) such that 

^ (g) {X, 0} = {X, 0} ® i;/ mod A. 

Observe that for each T the commutant of T contains at least the powers 
T", n G Z. Besides, note that each joined system (T x T)a5 for S G C(T) is 
isomorphic to T (an isomorphism is given by x 1 — > (x,Sx)). In particular, 
(T X T)a5 is ergodic as soon as T is ergodic. This gives for each ergodic 
T a family of "obvious" ergodic self-joinings: the At^, n G Z. In the case 
where T is weakly mixing, the product measure /u (8> ^ is also considered 
as an obvious ergodic self-joining. In 1979, Rudolph introduced in [22 the 
important notion of "minimal self joinings", which says that T has no other 
ergodic self-joinings of order 2 than the obvious ones. The notion of "fc-fold 
minimal self-joinings" refers to joinings of order k: 

Definition 2.10. For k > 2, we say that an ergodic T has A;-fold minimal 
self-joinings if for each ergodic joining X of k copies of T, we can partition 
the set {1, . . . ,k} of coordinates into two subsets Ji, . . . ,Je such that 

(1) for ji and j2 belonging to the same Ji, the marginal of A on the 
coordinates ji and j2 is some Ayn,- 

(2) for ji G Ji, . . . , j£ G Ji, the coordinates ji ■ ■ ■ ,ji are independent. 

We say that T has minimal self-joinings (MSJ) if T has k-fold minimal 
self-joinings for every k > 2. 

In fact, it has been proved by Glasner, Host and Rudolph W that for a 
weakly mixing T, 3-fold minimal self-joinings implies k-fold minimal self- 
joinings for all k. (See also a proof in HE]-) A very important question 
which remains open in the area is the following: 

Question 2.11. Does there exist a transformation T which has 2-fold but 
not 3- fold minimal self-joinings? 

(This question is closely related to the problem of 2-fold and 3-fold mixing: 
See Section EH) 

There are many examples of dynamical systems with the MSJ property, 
the simplest of which is probably the well known Chacon's transformation 
(see ^). One immediate consequence of this property is that C(T) is re- 
duced to the powers of T. This in turn implies that the transformation T 
has no square root (i.e. there is no automorphism S such that S o S = T). 
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It can be noticed that the first example of a transformation with no square 
root, given by Ornstein in 1970, belongs to the class of mixing rank- 
one^ systems, which all turned out to have the MSJ property, as King 
proved in 1988 Another joining proof of this result has been given 

by Ryzhikov |25j . 

If, in condition of the definition of MSJ, we replace At^ by A^, for 
some S € C(T), we get the weaker, but important, notion of simplicity. In 
the case of joinings of order 2, this property was introduced by Veech j3Uj . 
who proved a useful result on the structure of factor cj-algebras of 2-simple 
systems. The general definition of simplicity was later given by Del Junco 
and Rudolph 0. We also refer to Thouvenot [2H1 for a presentation of 
properties of simple systems. 

As a nice application of joinings to the study of C(T), we should also 
mention here Ryzhikov's proof of King's so-called Weak Closure Theorem, 
stating that if T is rank one, any S in C(T) is the limit of some sequence 
T"'= of the powers ofT 



In this section we discuss the links and the differences between the two 
properties proposed in the introduction to define "T and S are relatively 
prime". The starting point of the theory is the existence of a special joining 
arising when T and S have a common factor. 

3.1. Relatively independent joining above a common factor. We 

first define this special joining in the particular case of two systems which 
obviously have a common factor, since they both arise as a joining of a fixed 
system R with other systems. 

Let {X, £^ , [I, T), {Y, SS, V, S) and (Z, , p, R) be three dynamical systems. 
Suppose that At is a joining of T and R, and Xs is a joining of 5 and R. 
Then we can put all these systems together so that the marginal on {X x Z) 
is Xt and the marginal on (Y x Z) is Xs- First pick z according to the 
probability law p, then pick x and y according to their conditionnal law 
knowing z in the respective joinings At and Xs, but independently of each 
other. More precisely, consider the 3-fold joining of T, S and R denoted by 
At ®i? Xs defined by setting, for all A e £/ , B e ."Sg and C € 



We get in this way a joining of the two systems (T x R)xj, and {S x R)xg 
identifying the coordinate z on both systems. This joining is called the 
relatively independent joining of{TxR)xj, and {SxR)xg above their common 
factor R. 

We shall not define in this text what a rank-one system is; the reader unfamiliar with 
this notion can find a good introduction in |18| 



3. Joinings and factors 
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Figure 1. The relatively independent joining of T and S 
above their common factor R: Given a (common) fixed pro- 
jection on the factor, the joining is the direct product of the 
two conditional measures. 

Now turn to the more general situation of two systems T and S sharing a 
common factor R. This means that we have homomorphisms of dynamical 
systems ttt : X — > Z and 'ks '■ Y ^ Z, which give us special joinings 
G J{T,R) and Xj^^ G J{S,R) (see section . We can then consider 
the relatively independent joining A^ry ^rX-ks, which is a 3-fold joining of T, 
S and R. The marginal of X-j^j, ®_r Xj^g on X x y is now a joining of T and 
S, called the relatively independent joining of T and S above their common 
factor R, and denoted by fj, (8>/j ly. This joining has the strong property of 
identifying the projections of both systems on the common factor. 

In summary, the u-algebra ^ can be viewed as a sub-u-algebra of both 
£/ and any '^-measurable function can be viewed as a function defined 
on Z, and we have the following integration formula: 

/ f{x)g{y)d{fi<8)Ru)ix,y) = [ E[/(x)|<^](z) E[9(y)|^](z) d/>(z). 

JXxY JZ 

Proposition 3.1. Let ^®rv be the relatively independent joining ofT and 
S above their common factor R. Then 

TTTix) = TTsiy) ^i®RU-a.e. 

Proof. By construction, fi^RV is the marginal on X x y of the 3-fold joining 
Att-t ®r Ayrg of T, 5 and R whose marginal on X x Z (respectively Y x Z) 
is Attj, (respectively X-j^g). Recall that, from Lemma [2^ we have z = itt{x) 
ATTy-a.e. and z = i^siv) A7rg-a.e. Therefore, z = ttt{x) = iTs{y) X-j^j, <Sir Xj^^- 
a.e., hence nxix) = nsiy) n^Rf-a-e. □ 
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Note that what we did here with two systems could have been done with 
any finite or countable family of systems sharing a common factor. 

3.2. Self-joinings and factors. Any factor S of T can be considered as 
a common factor of two copies of T, thus gives rise to a special self-joining 
of T: The relatively independent self-joining of T above its factor S. As 
in ordinary arithmetic of integer numbers, every dynamical system which 
is not reduced to one point has at least two factors: T itself is of course a 
factor of T, and the system reduced to one point is also a factor of T. Note 
that the relatively independent joining of T above T itself is the self-joining 
Aid of T supported on the graph of Id, and that the relatively independent 
joining of T above the factor reduced to one point is the product measure 
fj, fi. Now, we want to show that any other factor of T is incompatible 
with the 2-fold MSJ property. 

Proposition 3.2. Let {X, £/ , fj,,T) be an ergodic dynamical system where 
X is not finite. IfT has another non-trivial factor than T itself, then T does 
not have the 2-fold MSJ property. 

Proof. Let S be a non-trivial factor of T, and consider /i (8)5 /i the relatively 
independent self-joining of T above S constructed from the homomorphism 
vr : X — > y. The system defined by this joining may not be ergodic, 
but if we assume further that T has the 2-fold MSJ property, the ergodic 
decomposition of fi (8)5 /U has the form 

IJL<^SI^ = 6'/Z (g) ^ + (1 - g) Pn^T" , 

new, 

where < 6* < 1, p„ > and X^nez^*" ~ ^- Denote by x and x' the two 
coordinates in the Cartesian square of X, and remember that, by Proposi- 
tion |H3 we have 7r{x) = ■k{x') fj, (8)5 //-a.e. Since 5 is not the factor reduced 
to one point, this would not be possible if appeared in the ergodic de- 
composition, hence 6 = 0. Moreover, if the ergodic decomposition of fi (8>5 fi 
was reduced to Am , -S* would be T itself. Hence there is some n ^ such 
that pn > 0. In other words, there exists some n / such that, under 

(5) ^ 05 = r"x) > 0. 

This implies that, in the factor S, y = S^y with positive probability. But 
S is ergodic (because T is), hence the space Y on which S acts is finite. In 
turn, this implies that there exists at least one y (zY such that 

(6) /x®sM((x' = r"x)n(7r(x) = y)) > 0. 

Now, observe that /u^s fJ- conditioned on (vr(x) = y) is the direct product of 
/i conditioned on (vr(x) = y) with itself. Thus, © implies that fi conditioned 
on (7r(x) = y) has at least one atom. But /x(7r(x) = y) > 0, hence n itself 
has an atom, hence T is periodic. □ 
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Corollary 3.3. IfT is ergodic aperiodic with the 2-fold MSJ property, then 
any non constant stationary process hving in the system T generates the 
whole a-algebra. 

3.3. Disjointness and lack of common factor. We now turn back to the 
case of two dynamical systems T and S sharing a common factor R. As we 
have already noticed in the case of self-joinings, Pr op osition 13 . 1 1 ensur es that 
if R is not reduced to one point, the relatively independent joining of T and 
S above R is not the product measure fi^iy, hence T and S are not disjoint. 
This means that Property (the disjointness property) implies Property^ 
(the lack of common factor). It had been asked by Furstenberg ^ whether 
these two properties were, in fact equivalent. The discovery by Rudolph of 
systems with the MSJ property allowed him to answer negatively to this 
question fIT\ . 

Proposition 3.4. There exist two non disjoint systems S and T without 
any other common factor than the trivial one. 

Proof. Let ^ = (^k) be a stationary process generating a system T with the 
MSJ property, and let (^[.) be an independent copy of (Ck)- We now define 
another stationary process (" by setting 

Ck ■= {Ck,(,k}- 

(This means that (^k tells us which are the two values taken by and , but 
neither which one is taken by ^fc nor which one is taken by ^^.) Denote by 
S the system generated by C: -S* is a factor of the Cartesian product T x T. 
Then T and S are certainly not disjoint, since the two processes ^ and C are 
clearly not independent of each other. However, suppose that S and T share 
a common factor which is not the system reduced to one point. Then, since 
T has the MSJ property, this factor has to be T itself. Therefore, we can 
find a third copy of ^, living in the Cartesian square T x T, and which 
is measurable with respect to the factor u-algebra generated by By the 
3-fold MSJ property of T, either ^" is equal to one of the processes ^ or 
possibly shifted by some constant integer k, or ^" is independent of ('^,'^')- 
The former case is impossible since hence is invariant by the "flip" 
(x, x') 1-^ {x' , x), which is clearly not true for any shifted copy of ^ or The 
latter case is also impossible since (^) and generate the whole cr-algebra 
in T X T. □ 

3.3.1. The fundamental lemma of non- disjointness. There exists however an 
important result which tells us how to derive some information on factors 
from the non-disjointness of two systems. This theorem appeared for the 
first time in year 2000, in two publications [71 115j. 

Theorem 3.5. IfT and S are not disjoint, then S has a non trivial common 
factor with some joining of a countable family of copies ofT. 
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Proof. Start with a joining A of (X, £/ , fi,T) and {Y, i', S): Then S" is a 
factor of (T X 5) A- Consider a countable family of copies of {T x S)\, and 
denote by Aqo their relatively independent joining over their common factor 
S. Since this joining identifies all the projections on Y, it can be viewed as 
a probability measm'e on y x X^, and is characterized by 

Xo^{BxAoX---xAkxXxX---) = [ ]Ex[lAo\y]- ■ -^xiiAMMy)- 

JB 

Observe that this probability measure is invariant under the shift on each 
y-fiber: 

{y,xo,xi,...) I — > {y,xi,X2...). 
Moreover, Aqo conditioned on such a fiber is a product measure: The in- 
finite direct product of A conditioned on y. A relative version of Kol- 
mogorov 0-1 law (precisely stated in ^21; Lemma 9) then tells us that the 
(7-algebra of shift-invariant events coincides modulo Aqo with the cj-algebra 
of y-measurable events. 

Now, let / be a bounded measurable function defined on X, which we also 
consider as a function on X xY and on y x X^ by setting f{x,y) := f{x), 
and f{y, xq, xi, . . .) := f{xo). Applying Birkhoff ergodic theorem to / in the 
system defined by the shift and Aqo gives 

^ n—l 

(7) lim - J]/(xfc) = lExJfly] = EA[/|y] Aoo-a.e. 

k=0 

Since T and S are not disjoint, we could have chosen X ^ ^ ® v, and 
then / such that ^^[/jy] is not constant modulo A. Then this conditional 
expectation generates in 5" a non-trivial factor, which by ((7| can be identified 
to some factor of the joining of a countable family of copies of T defined by 
the marginal of Aqo on X^ . □ 

3.3.2. Application: disjointness of classes of dynamical systems. We get 
from Theorem 13.51 some important disjointness results. Note that some 
properties of dynamical systems are stable under the operations of taking 
joinings and factors. We will call these properties stable properties. This 
is e.g. the case of the zero-entropy property: We know that a factor of 
a zero-entropy system still has zero entropy, and that any joining of zero- 
entropy systems also has zero entropy. Take now any iiT-system S: We know 
that any non-trivial factor of S has positive entropy, hence S cannot have a 
non-trivial common factor with a joining of copies of a zero-entropy T. 

The same argument also applies to the disjointness of discrete-spectrum 
systems with weakly mixing systems, since discrete spectrum is a stable 
property, and weakly mixing systems are characterized by the fact that 
they do not have any discrete-spectrum factor. We thus have the following 
theorem: 

Theorem 3.6. Any zero-entropy system is disjoint from any K -system. 
Any discrete-spectrum system is disjoint from any weakly mixing system. 
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3.4. Joinings and T-factors. The result of Theorem EISl leads to the intro- 
duction of a special class of factors when some dynamical system T is given: 
For any other dynamical system S, call T-factor of S any common factor 
of S with a joining of countably many copies of T. We also call T-factor 
a-algehra any factor o"-algebra of S associated to a T-factor of S. Another 
way to state Theorem 13.51 is then the following: If S and T are not disjoint, 
then S has a non-trivial T-factor. In fact, the proof of this theorem gives a 
more precise result: For any joining A of 5 and T, for any bounded measur- 
able function / on X, the factor cj-algebra of S generated by the function 
EA[/(a;)|y] is a T-factor fi-algebra of S. 

With the notion of T-factor, we can extend Theorem 13. 51 in the following 
way, showing the existence of a special T-factor a-algebra of S concentrating 
anything in S which could lead to a non trivial joining between T and S. 

Theorem 3.7. Given two dynamical systems {X, £/, /i, T) and {Y, v, S), 
there always exists a maximum T-factor a-algehra of S, denoted by ^t- 

Under any joining X ofT and S, the a-algebras si (8){0, and {0, X}®^ 
are independent conditionally to the a-algehra {0,X} ® ^t- 

The proof of the theorem is based on the two following lemmas. 

Lemma 3.8. Let {Bi)i<zi he a comitable family of events in such that 
for all i, Bi helongs to some T-factor a-algehra of S. Then there exists 
a T-factor a-algehra of S containing all the Bi 's. 

Proof. For each i G /, we have a joining Aj of S with a countable family 
(7i,n)nG]N of copics of T, such that C M,n mod Aj. Let us denote 

by Ri the dynamical system defined by A,, and by A the relatively indepen- 
dent joining of all the Ri, i (z I, over their common factor 5. We can see A 
as a joining of S with the countable family {Ti^n){i,n)eix^ and, for each i we 
have 

C (g) mod A. 

(i,n)G/x]N 

We conclude that the factor cr-algebra of S generated by all the .^j's is a 
T-factor cr-algebra, which certainly contains all the -Bj's. □ 

Lemma 3.9. Let ^ he a factor a-algehra of S. If there exists a joining 
\ of T and S under which the a-algehras s/ {0, Y} and {0, X} =^ are 
not independent conditionally to {0, X} then there exists a T-factor 
a-algehra of S, not contained in ^ . 

Proof. The hypothesis of the lemma implies the existence of a bounded 
measurable function f on X such that, on a set of positive z/-measure, 

^x{f{x)\m\ + Ea[/(x)|^]. 

The factor cr-algebra ^' of S generated by the function Ea[/(x)|,^] is not 
contained in but we saw in the proof of Theorem 13 . 51 that is a T-factor 
(T-algebra. □ 
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Proof of Theorem \3. 7| In order to prove the existence of a maximum T- 
factor (7-algebra, we define 

'■= {B G ^ : B belongs to a T-factor cr-algebra of S} , 

and we claim that it is a T-factor cr-algebra. Since (Y, i^, z/) is a Lebesgue 
space, the tj-algebra ^ equipped with the metric d{B,C) := iy{BAC) is 
separable (as usual, we identify subsets B and C of y when 1/(3 AC) = 0). 
There exists a countable family {Bi)i<zi dense in and, thanks to Lemma 
13.81 there exists a T-factor fi-algebra ^ containing all the BiS. By density, 
we have C ^ but, since contains all the T-factor cj- algebras, we 
have = ^ ■ This proves the first assertion of Theorem 13.71 The second 
one is just the application of Lemma 13.91 to D 

The notion of T-factor was introduced in ^Jj in order to study some 
aspects of the weak disjointness of dynamical systems. It was used there to 
prove that if T satisfies the property of being self-weakly disjoint to all order 
k >2 (which is the natural generalization of self weak-disjointness to the case 
of k copies of T), then T is weakly disjoint from any other dynamical system. 
The main open question in this area is whether the self weak disjointness of 
T alone is sufficient to ensure this universal weak disjointness property. 

A characterization of disjointness ? We know from Theorem 13.51 that if T 
and S are not disjoint, then the maximum T-factor of S is not trivial. One 
may ask whether the converse is true, but in fact it is not difficult to find 
a counterexample: Indeed, take {o'n)n&z any stationary process taking its 
values in {0, 1}, and let {Tn)nG'E be an i.i.d. process with r„ taking each 
value or 1 with probability 1/2, independent of (a"„). Then, setting 

Tn ■■= Tn + (T^ mod 2, 

we get another i.i.d. process (f^) with the same distribution as (t„,). The 
(cj„,) process is clearly measurable with respect to the cr-algebra generated 
by (r„) and (f„), which means that the dynamical system S generated by 
{an) is a factor of a self-joining of the Bernoulli shift generated by (t„). 
Therefore, denoting by T this Bernoulli shift, we get that any dynamical 
system 5 generated by a two-valued stationary process is a T-factor. But 
we could have taken for ((T„) any zero-entropy process, which is disjoint from 
any Bernoulli shift (Theorem 13. 6|) . 

However, we can notice that Theorem 13.51 gives a stronger conclusion 
when T and S are not disjoint. Indeed, this hypothesis being symmetric in 
S and T, we get in that case that both the maximum T-factor of S and the 
maximum S'-factor of T are not trivial. In the counterexample given above, 
S having zero entropy, the maximum 5-factor of a Bernoulli shift is trivial. 
So we ask the following question: 

Question 3.10. Can we find two disjoint systems S and T such that both 
the maximum T-factor of S and the maximum S -factor ofT are not trivial? 
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A remark on stable properties. Analyzing the proof of Theorem IH. 71 we can 
observe that the existence of the maximum T-factor a-algebra is in fact a 
corollary of the following: Being a T factor is a stable property (in the sense 
given in Section r3.3.2j) . And indeed, instead of the notion of T-factor we 
could take any stable property and we would get by the same argument 
the existence of a maximum factor cj-algebra satisfying this stable property. 
Some classical factor cr-algebras are particular cases of this type: If we start 
from the stable property of having zero entropy, we get the Pinsker a-algebra 
of the system, and if we start from the discrete-spectrum property, we get 
the Kronecker factor a-algebra. 



We now turn to the relationships between the study of joinings and some 
(weak and strong) mixing properties. We begin by a beautiful proof by 
Ryzhikov of a well-known result concerning weak mixing. 

4.1. Weak mixing and ergodicity of products. There are many differ- 
ent (but equivalent!) ways to define the property of weak mixing. A very 
concise one is the following: We say that T is weakly mixing if the Cartesian 
square T x T is ergodic. It is a standard result in ergodic theory that this 
property implies the seemingly stronger one: 

Theorem 4.1. Suppose that T x T is ergodic. Then for any ergodic S, 
T X S is ergodic. 

The classical proof of this implication makes use of the spectral theory of 
the action of the unitary operators Ut ■ / i— > / o T and Us ■ g ^ g o S 
on the subspaces of i^(/Lt) and T^(z/) of functions with zero mean. It can be 
shown that if T and S are ergodic, the Cartesian square T x 5 is ergodic if 
and only if Ut and Us do not have any common eigenvalue. 

We propose here an alternative approach by joinings to prove Theo- 
r em 14. 11 due to Ryzhikov |2Z1- Let us start with a simple lemma on relatively 
independent joinings. 

Lemma 4.2. Let A he a joining ofT and S, and let X^s ^ he the relatively 
independent self-joining of {T x S)x above its factor S. If the marginal of 
Ai^s A on X xX is the product measure n^n, then A is the product measure 

Proof. For all A G iz/, we have by definition of A 05 A 



JY 

Thus, if the marginal on X x X is the product measure, we get that for all 
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which is possible only if EA[lA|y] is a constant i/-a.e., hence \ = ^® v. □ 

Proof of Theorem \4-l\ Assuming that ^® \s an ergodic self-joining of T, 
we are going to show that for all ergodic S, ® v & Je{T,S), by checking 
that it is an extremal point in J{T,S), and applying Proposition 11.51 

Suppose that for some < p < 1 and some joinings Ai and A2 of T and 
S, we have 

= p\i + (l-p)\2. 

Note that the relatively independent self-joining oi above the factor S 
is nothing but i' fSi fi, which can be decomposed as follows: 

ll®V®ll = p'^\i®s^l + {'^-pf^2®S^2+p{'^-p)^l®S^2+p{'^-p)^2®S^l- 

Since /i (8> /i is ergodic, each term of the RHS must have /i (8> /i as marginal on 
X xX. But this implies that Ai and A2 satisfy the hypothesis of Lemma [4.21 
hence both Ai ad A2 are the product measure ix® v. □ 

4.2. Characterization of mixing by joinings. Strong mixing is also very 
easily characterized in term of joinings: By definition, the dynamical system 
T is strongly mixing (or simply mixing) if VA, B € , 

^i{Ar\T-''B) >M^H5), 

n—foo 

but this means precisely that the sequence (A-rn) converges in the topology 
of joinings to n ® ^. An elegant application of this characterization was 
found and used by Ornstein in ^Hl) where he constructed the first example 
of a transformation with no square root. 

Theorem 4.3 (Ornstein's criterion for mixing). The dynamical system T is 
mixing if and only if T is weakly mixing and there exists some real number 
d > such that 

(8) VA,SGi^, limsup/i(Anr~"B) < 9 fi{A) i_i{B) . 

n— >+oo 

Proof. The fact that mixing implies weak mixing can be proved by the fol- 
lowing argument using joinings: If T is mixing, convergence of (A^^n) to 
II II also implies convergence of {A^^xT)'^) (sequence of self-joinings of 
T X T) to n II li iSi II- And this gives that any T x T-invariant set in 
X X X has measure zero or one, i.e. T is weakly mixing. Condition © is 
obviously necessary for T to be mixing. 

Conversely, let us suppose that T is weakly mixing and that (jSJ holds. 
Then any cluster point A of the sequence (Aj-n) in J(T) satisfies \ < Oix® it., 
hence is absolutely continuous with respect to /i (8) /i. Since T is weakly 
mixing, is ergodic, therefore \ = il® ii. This proves that 

lim Ayn = II® II, 

n— »oo 



i.e. T is mixing. 



□ 
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4.3. 2-fold and 3-fold mixing. We finish tliis presentation by a long- 
standing open question in ergodic theory, and some of its relationships with 
joining theory. Recall that the property of (strong) mixing is also called 
2-fold mixing, because it involves only two subsets of the space. A strength- 
ening of this definition involving 3 (or A: > 3) sets gives rise to the property 
of 3-fold mixing (respectively k-fold mixing): 

Definition 4.4. The dynamical system T is said to be 3-fold mixing if 

yA,B,C (£ £/, 

lim /i(AnT-"Bnr-("+"')C) = fi{A)n{B)fi{C). 

n,m—>oo 

Again, this definition can easily be translated into the language of joinings: 
T is 3-fold mixing when the sequence (Aj^n ^n+m) converges in J^{T) to ^(g) 
H^fi asn and m go to infinity, where A is the obvious generalization 

of Ayn to the case of self-joinings of order 3. 

Whether 2-fold mixing implies 3-fold mixing was asked long ago by Rohlin 
j2Uj . and the question is still open today. However, some important results 
have been established around this problem. Some of them could make us 
think that this implication is false, by exhibiting counterexamples to the 
same question but in a generalized context. In this category, we must in 
particular cite Ledrappier's example in Jl] of a Z^-action which is 2-fold but 
not 3- fold mixing. More recently, it was noticed in j23j that if we consider the 
property of being mixing relatively to a factor cr-algebra (roughly speaking, 
this means that we replace the measure by the conditional expectation with 
respect to this cr-algebra), then we can find examples of 2- fold but not 3- 
fold mixing. On the other hand, there are also many results which go in 
the opposite direction. We know that 2-fold mixing implies 3-fold mixing in 
many cases: For example when T has singular spectrum (Host, |Sj), when 
T is rank one (Kalikow, ^2) or even for finite rank systems (Ryzhikov, 
when T is generated by a Gaussian system (Leonov, JH]; see also 
Totoki [22]). In fact, in many of these works it is shown that a stronger 
result holds for the respective class of systems under consideration, and this 
result concerns joining theory: For finite rank systems, as well as for singular 
spectrum systems, it is proved that any ergodic self-joining A of order 3 which 
has pairwise independent marginals is necessarily the product measure. As 
far as Gaussian systems are concerned, this property has been established 
for an important subclass of the zero-entropy Gaussian processes, called 
GAG ^ni- It is not known whether it can be extended to all zero-entropy 
Gaussian processes. More generally, we may ask the following important 
question: 

Question 4.5. Does there exist a zero-entropy dynamical system T and an 
ergodic joining A € J3 (T) for which the coordinates are pairwise independent 
but which is different from /i (8) ^ (X> /U? 

Why a negative answer to this question would solve Rohlin's multiple mix- 
ing problem is explained by the following remarks: First, if we had a 2-fold 
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mixing but not 3-fold mixing dynamical system T, the sequence (A 
would have a cluster point A 7^ fi^ fii^ fi. Since T is 2- fold mixing, the coor- 
dinates would be pairwise independent for A. And we could find an ergodic 
joining satisfying those two properties by taking an ergodic component of 
A. Second, Thouvenot proved that if we could find a counterexample to 
Rohlin's question, then we could find one in the zero-entropy category. This 
restriction to zero entropy is crucial since we can easily construct examples 
answering Question 14.51 if we drop it: Start from the (1/2,1/2) Bernoulli 
shift T generated by the i.i.d. process (^p)pgZ) taking its values in {0,1}. 
Take an independent copy {(,p)pe'z of this process and set 

:= + i'p mod 2. 

Then the three processes ^, and ^" have the same distribution, are pairwise 
independent, but the self-joining of T of order 3 that we get in this way is 
not the product measure. 

Observe also that a negative answer to Question 14.51 would imply another 
important result in joining theory: It would show that 2-fold MSJ implies 
fc-fold MSJ for all k>2. Indeed, any 2-fold MSJ but not 3-fold MSJ would 
have an ergodic joining A of order 3 for which the coordinates would be 
pairwise independent, but not independent. 

The purpose of the following is to show that, if ever we could find some 
system satisfying the requirements of Question 14. 5| it must be of a different 
nature than the construction of Ledrappier for an action of , or than the 
relative example given in [2^]. Indeed, each of these two constructions leads 
to some "joining" A, for which we get 3 pairwise independent coordinates xi, 
X2 and rE3, and for which the global independence is denied by the following 
property: We have some random variable ^ defined on X, taking its value 
in {0, 1} (each value being taken with probability 1/2), and satisfying 

(9) ^(xs) = i{xi)+i{x2) mod 2 (A-a.e.). 

There are two ways of interpreting equation Q: First, the random vari- 
able ^ takes its values in a finite alphabet A, and there exists a function 
/ : Ax A such that 

^(X3) = /(^(xi),e(x2)) (A-a.e.). 

Second, the random variable ^ takes its values in a compact abelian group 
G, and we have 

= i{xi)+^{x2) (A-a.e.). 

We want to show now that there cannot exist an example answering Ques- 
tion of either of those types, by proving the following two propositions. 

Proposition 4.6. Suppose that there exist some joining A G J3{T), with 
pairwise independent coordinates, and some random variable ^ : X ^ A 
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(finite alphabet) such that 

e(^3) = /(^(xi),e(^2)) (A-a.e.). 

Then 

• either the factor of T generated hy ^ is periodic, 

• or this factor has an entropy at least log 2. 

Proposition 4.7. The dynamical system T is assumed to be ergodic. Sup- 
pose that for some joining A € J3 (T) with pairwise independent marginals, 
there exists some measurable map : X ^ G (compact Abelian group) 
which is not fi-a.e. constant, and such that 

e(x3) = C{xi)+^{x2) (A-a.e.) 
then T has positive entropy. 

Lemma 4.8. Let ^1, ^2 and ^3 be 3 random variables taking their values 
in the finite alphabet A, identically distributed, pairwise independent, and 
such that there exists a function f : A x A ^ A satisfying 

6 = (a.e.) 

Then their common probability law is the uniform law on the set of values 
which are taken with positive probability. 

Proof. Taking a subset of A instead of A if necessary, we can assume that 
for all i & A, 

Pi := P(ei = i)>0. 

For all i and k in A, there exists a unique j = jfc(i) such that /(i, = k. 

Moreover, we have 

Pk = ^PiP{£,3 = = i) = '^PiPjkii)- 

i i 

In other words, vr = Mtt where vr := {pi)i(zA and M is the bistochastic 
matrix whose entries are rrik^i ■= Pjf,{i)- Taking the limit as s — > 00 in the 
equality vr = M^tt, we get that vr has to be the uniform law on ^. □ 

Proof of Proposition \4.6\ From the preceding lemma, we know that ^ is uni- 
formly distributed on the subset of values which are taken with positive 
probability. But for any m > 1, Lemma 14.81 also applies to 

C"' ■■ (c(x),e(rx),...,e(r-"M)- 

Let io, • • . , im~2 in A be such that 

Then, the conditional probability law of ^(T™'~^x) knowing the above event 
is uniform on the set of values to which it assigns a positive mass. More- 
over, we easily check that the cardinal am of the support of this conditional 
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probability law does not depend on the choice of io, • • • , im-2, and satisfies 
am < flm-i- Thus, 

• either the sequence (am) reaches 1, and in this case the factor gen- 
erated by ^ is periodic; 

• or am is always greater than or equal to 2, and then the entropy of 
the factor generated by ^ is at least log 2. 

□ 

Lemma 4.9. Let ^i, ^2 and ^3 be 3 random variables taking their values in 
the compact Abelian group G, identically distributed, pairwise independent, 
and satisfying 

6 = 6+ 6 (a.e.) 

Then their common probability law is the Haar measure on some compact 
subgroup of G. 

Proof. Denote by v the probability law of ^1 on G. For z^-almost every 
g G G, is invariant under the addition of g, since knowing ^ = g, both ^1 
and = + 5 are distributed according to v. Let us define 

H := {g £ G : i' is invariant under the addition of g}. 

This is a closed subgroup of G containing the support of z/. Thus z/ is a 
probability law on H, invariant under every translation on H: v is the Haar 
measure on H. □ 

Proof of Proposition \4- 7| Let us consider ■0 : X — > G^ defined by 

^Pix) := iaT^x))n&. 

From Lemma 14.91 the probability law of ip{x) is the Haar measure on some 
compact subgroup H of G^. Moreover, H is invariant by the shift, denoted 
by cj, on G^. The restriction of a to -ff is therefore an ergodic endomorphism 
of H, which is a factor of T (This is the factor generated by ^). A theorem 
by Juzvinskii P| tells us that any such ergodic group endomorphism has to 
be a ii'-system; hence T has positive entropy. □ 

Observe that the two situations exposed in Propositions 14.61 and 14.71 can 
be seen as particular cases of a third one, from which I do not know whether 
some similar conclusion can in general be derived. 

Question 4.10. What can be said about a dynamical system T satisfying 
the following: There exists an ergodic self-joining A G J^iT), with pairwise 
independent marginals, and a map f : X x X ^ X such that 



X3 = f{xi,X2) (A-a.e.)? 
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